POOCLAPACK: Parallel Out-of-Core Linear Algebra Package

نویسندگان

  • Wesley C. Reiley
  • Robert A. van de Geijn
چکیده

In this paper parallel implementation of out-of-core Cholesky factorization is used to introduce the Parallel Outof-Core Linear Algebra Package (POOCLAPACK), a flexible infrastructure for parallel implementation of out-of-core linear algebra operations. POOCLAPACK builds on the Parallel Linear Algebra Package (PLAPACK) for in-core parallel dense linear algebra computation. Despite the extreme simplicity of POOCLAPACK, the out-of-core Cholesky factorization implementation is shown to achieve in excess of 80% of peak performance on a 64 node configuration of the Cray T3E-600. Preliminary results from the HP Exemplar X-Class that demonstrate the portability of POOCLAPACK are also given.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Anatomy of a Parallel Out-of-Core Dense Linear Solver

{ In this paper, we describe the design and implementation of the Platform Independent Parallel Solver (PIPSolver) package for the out-of-core (OOC) solution of complex dense linear systems. Our approach is unique in that it allows essentially all of RAM to be lled with the current portion of the matrix (slab) to be updated and fac-tored, thereby greatly improving the computation to I/O ratio o...

متن کامل

Advanced Complex Trait Analysis

MOTIVATION The Genome-wide Complex Trait Analysis (GCTA) software package can quantify the contribution of genetic variation to phenotypic variation for complex traits. However, as those datasets of interest continue to increase in size, GCTA becomes increasingly computationally prohibitive. We present an adapted version, Advanced Complex Trait Analysis (ACTA), demonstrating dramatically improv...

متن کامل

Symmetric Indefinite Linear Solver using OpenMP Task on Multicore Architectures

Recently, the Open Multi-Processing (OpenMP) standard has incorporated task-based programming, where a function call with input and output data is treated as a task. At run time, OpenMP’s superscalar scheduler tracks the data dependencies among the tasks and executes the tasks as their dependencies are resolved. On a shared-memory architecture with multiple cores, the independent tasks are exec...

متن کامل

A survey of out-of-core algorithms in numerical linear algebra

This paper surveys algorithms that efficiently solve linear equations or compute eigenvalues even when the matrices involved are too large to fit in the main memory of the computer and must be stored on disks. The paper focuses on scheduling techniques that result in mostly sequential data accesses and in data reuse, and on techniques for transforming algorithms that cannot be effectively sched...

متن کامل

Research Accomplishments and Objectives

This document presents my research accomplishments and objectives as of Spring 2002. The document references and lists most of my publications, so it also serves as an annotated list of publications. In addition, Section 8 discusses my teaching and describes a textbook that I have written. Most of my recent and current research focuses on discrete and computer-architecture-related issues in num...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999